Three dimensions of the so-called "interoperability" of annotation schemes
نویسنده
چکیده
“Interoperability” of annotation schemes is one of the key words in the discussions about annotation of corpora. In the present contribution, we propose to look at the so-called interoperability from (at least) three angles, namely (i) as a relation (and possible interaction or cooperation) of different annotation schemes for different layers or phenomena of a single language, (ii) the possibility to annotate different languages by a single (modified or not) annotation scheme, and (iii) the relation between different annotation schemes for a single language, or for a single phenomenon or layer of the same language. The pros and cons of each of these aspects are discussed as well as their contribution to linguistic studies and natural language processing. It is stressed that a communication and collaboration between different annotation schemes requires an explicit specification and consistency of each of the schemes.
منابع مشابه
An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملAnnotation schemes, annotation tools and the question of interoperability: from Typed Feature Structures to XML Schemas
The multiplication of annotation schemes (each project proposing its own) and that of coding formats (one per annotation tool) is a severe limitation for interoperability. We propose in this paper an approach relying on a high-level and format-free representation of information, before any coding process. This approach consists in specifying the annotation scheme in terms of typed feature struc...
متن کاملAnnotation in Architecture: A Systematic Approach toward Mobilization and Development of Theoretical, Research, and Critical Basis in Architecture
Annotations usually refer to marginal notes that explain a difficult or ambiguous subject, provide a general definition or a critical remark for a particular part of a text. Historically, annotating was a well-known tradition in Islamic sciences and was used especially in times when there were less new potentials for generating new knowledge. The main question of this research is, can the tradi...
متن کاملIntegrating Annotation Tools into UIMA for Interoperability
In this paper, we discuss the issue of implementing the interoperability of natural language annotation tools for text mining with the Unstructured Information Management Architecture (UIMA) (Ferrucci and Lally, 2004; http://incubator.apache.org/uima). In particular, we discuss the practical issue of designing UIMA annotation schemes for text mining applications based on our experience in the E...
متن کاملTowards Interoperability for the Penn Discourse Treebank
The recent proliferation of diverse types of linguistically annotated schemes coded in different representation formats has led to efforts to make annotations interoperable, so that they can be effectively used towards empirical NL research. We have rendered the Penn Discourse Treebank (PDTB) annotation scheme in an abstract syntax following a formal generalized annotation scheme methodology, t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014